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Abstract. This paper considers the empirical spectral measure of a power of a random 
matrix drawn uniformly from one of the compact classical matrix groups. We give sharp 
bounds on the Lp-Wasserstein distances between this empirical measure and the uniform 
measure on the circle, which show a smooth transition in behavior when the power increases 
and yield rates on almost sure convergence when the dimension grows. Along the way, we 
prove the sharp logarithmic Sobolev inequality on the unitary group. 



1. Introduction 

The eigenvalues of large random matrices drawn uniformly from the compact classical 
groups are of interest in a variety of fields, including statistics, number theory, and math- 
ematical physics; see e.g. [7] for a survey. An important general phenomenon discussed at 
length in [7] is that the eigenvalues of an x random unitary matrix U, all of which 
lie on the circle 8:^ = {z £ C : \z\ = 1}, are typically more evenly spread out than 
independently chosen uniform random points in It was found by Rains [15] that the 
eigenvalues of U'^ are exactly distributed as independent random points in S^; similar 
results hold for other compact Lie groups. In subsequent work [16], Rains found that in a 
sense, the eigenvalues of C/™ become progressively more independent as m increases from 1 
to N. 

In this paper we quantify in a precise way the degree of uniformity of the eigenvalues of 
[7™- when U is drawn uniformly from any of the classical compact groups U(A^), SU(A^), 
O (A^), SO (A^), and Sp {2N). We do this by bounding, for any p > 1, the mean and tails 
of the Lp-Wasserstein distance Wp between the empirical spectral measure fiN,m of f/™" and 
the uniform measure u on (see Section H] for the definition of Wp). In particular, we show 
in Theorem 1111 that 



log(^)+l 



'm 

(1) EWp{fiN,^,i^) <Cp ^ 

Theorem [THl gives a subgaussian tail bound for VFp(//Ar,m) which is used in Corollary [H] 
to conclude that if m = m{N), then with probability 1, for all sufficiently large A^, 



(2) M^,(/..,.,.)<Cp- V'^^^^ 



^min{l,l/2+l/p} ■ 

In the case m = 1, we obtain that the distance between the empirical spectral measure 
o f a u niform N x N random matrix and the uniform measure is at most of the order 
. The distance between and any probability measure supported on A^ points is at 
least cN"^, so that ([T]) and ([2]) are in this case best possible up to the logarithmic factor. 
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When m = N, Rains's theorem says that hn,n is the empirical measure of A'^ independent 
uniformly distributed points in S^, for which the estimate in ([T|) of order A^~^/^ is optimal (cf. 
[6]). We conjecture that Theorem 1111 and Corollary 1141 which interpolate naturally between 
these extreme cases, are optimal up to logarithmic factors in their entire parameter space. 

In the case that m = 1 and p = 1, these results improve the authors' earlier results in 
|14j (where Wi{ij,n^i, v) was bounded above by CN~'^^^) to what we conjectured there was 
the optimal rate; the results above are completely new for m > 1 or p > 1. 

The proofs of our main results rest on three foundations: the fact that the eigenvalues 
of uniform random matrices are determinantal point processes, Rains's representation from 
|16j of the eigenvalues of powers of uniform random matrices, and logarithmic Sobolev 
inequalities. In Section [2l we combine some remarkable properties of determinantal point 
processes with Rains's results to show that the number of eigenvalues of U"^ contained 
in an arc is distributed as a sum of independent Bernoulli random variables. In Section 
131 we estimate the means and variances of these sums, again using the connection with 
determinantal point processes. In Section HI we first generalize the method of Dallaporta 
[5] to derive bounds on mean Wasserstein distances from those data and prove Theorem 
[TTJ Then by combining Rains's results with tensorizable measure concentration properties 
which follow from logarithmic Sobolev inequalities, we prove Theorem [13] and Corollary 
[m We give full details only for the case of U(A^), deferring to Section [5] discussion of the 
modifications necessary for the other groups. 

In order to carry out the approach above, we needed the sharp logarithmic Sobolev 
inequality on the full unitary group, rather than only on SU (A^) as in [13] . It has been 
noted previously (e.g. in [IOl|T]) that such a result is clearly desirable, but that because the 
Ricci tensor of U (A^) is degenerate, the method of proof which works for SU (A^), SO (A^), 
and §p (2A^) breaks down. In the appendix, we prove the logarithmic Sobolev inequality 
on U (A^) with a constant of optimal order. 

2. A MIRACULOUS REPRESENTATION OF THE EIGENVALUE COUNTING FUNCTION 

As discussed in the introduction, a fact about the eigenvalue distributions of matrices 
from the compact classical groups which we use crucially is that they are determinantal 
point processes. For background on determinantal point processes the reader is referred to 
[llj . The basic definitions will not be repeated here since all that is needed for our purposes 
is the combination of Propositions [T] and [5] with Proposition [2] and Lemma [6] below. The 
connection between eigenvalues of random matrices and determinantal point processes has 
been known in the case of the unitary group at least since [8]. For the other groups, the 
earliest reference we know of is [12]. Although the language of determinantal point processes 
is not used in [12], Proposition [1] below is essentially a summary of [HI Section 5.2]. We 
first need some terminology. 

Given an eigenvalue e*^, < < 2ir, of a unitary matrix, we refer to 9 as an eigenvalue 
angle of the matrix. Each matrix in SO (2A^ + 1) has 1 as an eigenvalue, each matrix in 
SO" (2A^ + 1) has —1 as an eigenvalue, and each matrix in SO" (2A^ + 2) has both —1 and 
1 as eigenvalues; we refer to all of these as trivial eigenvalues. Here SO~ (A^) = {U £ 
O (N) : det U = —1}, which is considered primarily as a technical tool in order to prove our 
main results for O (A^). The remaining eigenvalues of matrices in SO (A^) or Sp (2A'^) occur 
in complex conjugate pairs. When discussing SO(A^), SO~ (A^), or Sp(2A^), we refer to 
the eigenvalue angles corresponding to the nontrivial eigenvalues in the upper half-circle as 
nontrivial eigenvalue angles. For ILJ(A^), all the eigenvalue angles are considered nontrivial. 
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Proposition 1. The nontrivial eigenvalue angles of uniformly distributed random matrices 
in any of SO (N), SO~ (N), U (A^), Sp (2A^) are a determinantal point process, with respect 
to uniform measure on A, with kernels as follows. 





KN{x,y) 


A 


SO (2iV) 


N-l 

1 + 2cos(jx) cos(jy) 


[0,vr) 


SO(2iV + l),SO" (2iV + l) 




[Cvr) 


U(iV) 


N~l 
j=0 


[0, 2tt) 


Sp(2iV),SO" (2iV + 2) 


N 

^2sin(jx)sin(jy) 


[Cvr) 



Proposition [T] allows us to apply the following result from [TT]; see also Chapter 4 of [T]. 

Proposition 2. Let K : Ax A ^ C be a kernel such that the corresponding integral operator 
% : L'^in) — > L'^ilJ') defined by 

%{f){x) = I K{x,y)f{y) dfi{y) 

is self-adjoint, nonnegative, and locally trace-class with eigenvalues in [0,1]. For D Q A, 
let KD{x,y) = lD{x)K{x,y)lDiy) be the restriction of K to D, and denote by Xj. the 
eigenvalues of the corresponding operator %£, on L^{D). Denote by 'Hd the number of 
particles of the determinantal point process with kernel K which lie in D. Then 

k 

where "=" denotes equality in distribution and the .^^ are independent Bernoulli random 
variables with P[^fc = 1] = and P[,^fc = 0] = 1 — A^. 

In order to treat powers of uniform random matrices, we will make use of the following 
elegant result of Rains. For simplicity of exposition, we will restrict attention for now to the 
unitary group, and discuss in Section [5] the straightforward modifications needed to treat 
the other classical compact groups. 

Proposition 3 (Rains, [16]). Let m < N be fixed. If ~ denotes equality of eigenvalue 
distributions, then 

u(ivr~ u 

0<j<m 

That is, if U is a uniform N x N unitary matrix, the eigenvalues of are distributed as 
those of m independent uniform unitary matrices of sizes |_^J and |"^] , such that the sum 
of the sizes of the matrices is N . 
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Corollary 4. Let 1 < m < N , and let U € U (n) be uniformly distributed. For 6 € [0, 27r), 

denote by N^™^ the number of eigenvalue angles of U"^ which lie in [0,0). Then !N"g"^^ is 
equal in distribution to a sum of independent Bernoulli random variables. Consequently, 
for each t > 0, 



where = Var . 

Proof. By Proposition O X^™"^ is equal to the sum of m independent random variables 
Xi, 1 < i < m, which count the number of eigenvalue angles of smaller-rank uniformly 
distributed unitary matrices which lie in the interval. Propositions [1] and [2] together imply 
that each Xi is equal in distribution to a sum of independent Bernoulli random variables, 
which completes the proof of the first claim. The inequality ([3]) then follows immediately 
from Bernstein's inequality [W\ Lemma 2.7.1]. □ 



In order to apply ([3]) , it is necessary to estimate the mean and variance of the eigenvalue 
counting function Jsfg'"'* . As in the proof of Corollary HI this reduces by Proposition [3] to 
considering the case m = 1. Asymptotics for these quantities have been stated in the 
literature before, e.g. in [18], but not with the uniformity in 6 which is needed below, so we 
indicate one approach to the proofs. A different approach yielding very precise asymptotics 
was carried out by Rains [15] for the unitary group; we use the approach outlined below 
because it generalizes easily to all of the other groups and cosets. 

For this purpose we again make use of the fact that the eigenvalue distributions of these 
random matrices are determinantal point processes. It is more convenient for the variance 
estimates to use here an alternative representation to the one stated in Proposition [Tj (which 
is more convenient for verifying the hypotheses of Proposition [2] and for the mean estimates). 
First define 



The following result essentially summarizes \12\ Section 5.4]. (Note that in the unitary case, 
the kernels given in Propositions [1] and [5] are not actually equal, but they generate the same 
process). 

Proposition 5. The nontrivial eigenvalue angles of uniformly distributed random matrices 
in any of SO (N), SO~ (N), U (A^), Sp (2A^) are a determinantal point process, with respect 
to uniform measure on A, with kernels as follows. 



(3) 




3. Means and variances 




sin (^)/ sin (f) ifx/0, 
N if X = 0. 
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KN{x,y) 


A 


SO (2iV) 


^ {S2N-i{x -y) + S2N-i{x + y)^ 




[0,^) 


SO (2iV + 1) , SO" (2iV + 1) 


^ (^S2n{x -y) - S2n{x + yfj 


[0,^) 


U(iV) 


Sn{x - y) 


[0, 27r) 


Sp(2iV),SO~ (2iV + 2) 


^ {S2N+i{x -y) - S2N+i{x + y)^ 




[0,^) 



The following lemma is easy to check using Proposition [2l For the details of the variance 
expression, see [9l Appendix B]. 

Lemma 6. Let K : I x I ^ be a continuous kernel on an interval I representing an 
orthogonal projection operator on L?'{ii), where fi is the uniform measure on I. For a 
subinterval D <^ I, denote by 'Nu the number of particles of the determinantal point process 
with kernel K which lie in D. Then 

EXd = / K{x, x) dn{x) 
Jd 

and 

VarNB= / / K{x,y)^ dfiix) d^iiy). 

JD JI\D 



Proposition 7. (1) Let U be uniform in V (N). For 9 G [0,27r), let 'Ng be the number 
of eigenvalues angles of U in [0, 9) . Then 

(2) Let U be uniform in one of SO (2iV), SO" (2A^ + 2), SO (27V + 1), SO" {2N + I), 
or Sp(2A^). For 9 £ [0,7r), let JsTg be the number of nontrivial eigenvalue angles of 
U in [0,6*). Then 



TT 



< 1. 



Proof. The equality for the unitary group follows from symmetry considerations, or imme- 
diately from Proposition [5] and Lemma [6l 

In the case of Sp (2iV) or SO" {2N + 2), by Proposition [U and Lemma EJ 



2 sin^ (jx) dx 



N 



N9 1 •^sm{2j9) 



IT 2tt 



i=i 



Define oq = and aj = X^^^ sm{2k9). Then by summation by parts. 



N 

E 



sm{2j9) ajv 



N-l 
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Trivially, |aAr| < N. Now observe that 

2i6» e - i 



Qj = Im 



k-l 
2iB \^ 2ije 



j=0 



Im 



Im 



1)0 smjje) 
sm{9) 



sm{{j + l)6l)sin(j6') 
sm{e) 



Since Uj is invariant under the substitution i— )• tt — O, it suffices to assume that < 6 < Tr/2. 
In that case sin(0) > 26 /tt, and so 



N-l 

E 



All together, 



< 



TT 

29 



E ''^+ E 

i<i<i/e 



1 



EJ<g 

TT 



l/e<j<N~l 

1 + vr 



j{j + 1) 



vr. 



< 



27r 



The other cases are handled similarly. 



□ 



As before, we restrict attention from now on to the unitary group, deferring discussion 
of the other cases to Section [5j 

Proposition 8. Let U be uniform in U(A^). For 9 G [0,27r), let J^g be the number of 
eigenvalue angles of U in [0,9). Then 

Yavyig < logiV + 1. 

Proof. If ^ € (7r,27r), then J^g = N — ^^2^-61 and so it suffices to assume that 9 < ir. By 
Proposition [5] and Lemma [6l 



1 


/ Sn{x - 
Je 


4^ 


Jo sin2 (1) 


1 




re zsiiP{^) 


2^ 


Jo sin^ (1) 



1 



4^2 



Je~y 



sm 



2 



dz dy 



dz + 



dz + 



2-K-d o^:,-,2 I Nz 



sm 



2 (^z 
2 (Nz 



sm- 1 f ) 



dz + 



2tt 



(27r-z)sin2 



27r- 



sm 



Mi) 



sin2 (I) 



For the first integral, since sin (|) > ^ for all 2 G [0, if > then 



z sm 



2 ( Nz 



Nz \ 
2 y 



sin2 (f ) 



{-nNfz 



%2 



J_ z 

AT 



dz+ — dz = TT^[- + log(iV) - log(0) 



Ife < i, there is no need to break up the integral and one simply has the bound 



< ^ 



Similarly, \{ 9 < -i, then 



' sm 



sm 



2 I z 



f) 



dz < 



9{'nNf 



dz + 



J_ z^ 

AT 



dz 



^il-N9) + .^N9-.9<'A 
4 4 



if 6* > i, there is no need to break up the integral and one simply has a bound of tt 
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All together, 



VarXg < log(iV) + 



11 
16' 



□ 



Combining Propositions [3l [71 and [HI one obtains the following. (The N/m in the variance 
bound, as opposed to the more obvious \N/rn\ , follows from the concavity of the logarithm.) 

Corollary 9. Let U he uniform in U {N) and 1 < m < N . For 6 G [0, 2tt), let ?sfg™^ be the 
number of eigenvalue angles of in [0,^). Then 



and 



VarN 



(m) 



, , , N 

< m log — 

m 



4. Wasserstein distances 

In this section we prove bounds and concentration inequalities for the spectral measures 
of fixed powers of uniform random unitary matrices. The method generalizes the approach 
taken in [5] to bound the distance of the spectral measure of the Gaussian unitary ensemble 
from the semicircle law. 

Recall that for p > 1, the Lp- Wasserstein distance between two probability measures fi 
and 1/ on C is defined by 

Wp{fj.,i') = I inf / |tf — d'K{w,z)\ , 

\n£U{^i,u) J J 

where n(^, i^) is the set of all probability measures on C x C with marginals fi and z/. 

Lemma 10. Let 1 < m < N and let U G U (A^) be uniformly distributed. Denote by e*^^ , 
1 ^ J ^ A, the eigenvalues of U"^, ordered so that < 9i < ■ ■ • < On < 2vr. Then for each 
j and u > 0, 

,2 



(4) 



27ri 
N 



47r 



< 4exp 



u 



mm 



(log(^)+l) 



m 



Proof. For each 1 < j < A and u > 0, if j + 2n < A then 



27r7 47r 



< 



27r(j + 2n) 
JV 

(m) 



r(m) 



27r(j+2u) 
JV 



EN 



27r(j + 2n) 



i + 2n - J^Q^ 
> 2u 



> 2u 



N 



lfj + 2u>N then 



27r7' Att 



' [Oj > 27r] = 0, 



27rj _ 4n 



and the above inequality holds trivially. The probability that 9j < -j^ — 
the same way. Inequality ((H) now follows from Corollaries [H and [9l 



u is bounded in 

□ 



Theorem 11. Let ^N,m ihe spectral measure ofU^, where 1 < m < N and U £ U ( A) 
is uniformly distributed, and let v denote the uniform measure on Then for each p > 1, 



EWp(/i7v,„,i/) < Cp 



"^[log(^) + l 
A^ 



where C > is an absolute constant. 
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Proof. Let 9j be as in Lemma [TOl Then by Fubini's theorem, 



E 



2vrj 

N 



P poo 



< 





(47r)Pp 

NP 
4(47r)Pp 
NP 

4(47r)P 
NP 



N 



> t 

_ 27rj 
n 



dt 



P/2 















log 








\m J 



An 



<8T{p+l)\—Jm 



log ( - ) + 1 

m 



Observe that in particular, 



Var^j < C- 



Ar2 



Let i^N be the measure which puts mass jj- at each of the points e^'^*-'/^, 1 < i < A^. 
Then 



1 ^ 



< E 



JV 



-y 

N ^ 



Att 



<8T{p+l) yj^xjm 



log ( - ) + 1 

m 



p 



' N 



It is easy to check that Wp{i'N,i') < ^. Together with Holder's inequality, the triangle 
inequality for Wp, and Stirling's formula, this completes the proof. □ 



In the case that m = 1 and p < 2, Theorem 1111 could now be combined with Corollary 2.4 
and Lemma 2.5 from [T3] in order to obtain a sharp concentration inequality for Wp{fi]y^i, u). 
However, for m > 1 we did not prove an analogous concentration inequality for Wp{^N,m-, ^) 
because the main tool needed to carry out the approach taken in [14], specifically, a loga- 
rithmic Sobolev inequality on the full unitary group, was not available. The appendix to 
this paper contains the proof of the necessary logarithmic Sobolev inequality on the unitary 
group (Theorem lisp and the approach to concentration taken in [14], in combination with 
Proposition O can then be carried out in the present context. 

The following lemma, which generalizes part of [14^ Lemma 2.3], provides the necessary 
Lipschitz estimates for the functions to which the concentration property will be applied. 

Lemma 12. Let p > 1. The map A i— )• taking an N x N normal matrix to its spectral 
measure is Lipschitz with constant A^~i/™^^{P'2} with respect to Wp. Thus if p is any fixed 
probability measure on C, the map A i— )• Wp{iJ,A, p) is Lipschitz with constant N~^/ ^^^^P''^\ 
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Proof. If A and B are N x N normal matrices, then the Hoffman- Wielandt inequahty O 
Theorem VI. 4.1] states that 

N 

(5) 



mm Y,\Xj{A) - X„(^^^{B)\' <\\A-B\ 



2 

HS ' 



where Ai(^), . . . , XnIA) and Xi{B), . . . , Xn{B) are the eigenvalues (with multiplicity, in any 
order) of A and B respectively, and Sjy is the group of permutations on N letters. Defining 
couplings of HA and fi b given by 

1 ^ 



for 0" G Sjv, it follows from ([5]) that 



%(/iA,^B) < mm 1^1^^ |A,(^) - A.(,)(i?)|^j 



i/p 



N 



1/2 



□ 



Theorem 13. Let fiN,m be the empirical spectral measure of where U G U(iV) is 
uniformly distributed and 1 < m < N , and let v denote the uniform probability measure on 
§^ . Then for each t > 0, 



[log(f)+l 



m 



N 



+ t 



< exp 



2+2n 



for 1 < p <2 and 



WpiflN,,!!,!^) > Cp 



^ [log (^) + 1 



+ t 



< 



exp 



24m 



^l+2/p^2 

24m 



for p > 2, where C > is an absolute constant. 

Proof. By Proposition [3l fj.N,m is equal in distribution to the spectral measure of an x 
random matrix C/i • • • Um, where the Uj are independent and uniform in U(|_^J) 
and U([f]). Identify 

/^A'",m with this measure and define the function F(Ui, . . . , Um) — 
Wp{fiN,m,'^)- Observing that |_^J > Corollarv 1171 combined with the mean estimate 
in Theorem [11] and the Lipschitz estimates of Lemma [T2l completes the proof. □ 



Corollary 14. Suppose that for each N, Un G U {N) is uniformly distributed and 1 < 
TUN < N . Let V denote the uniform measure on S^. There is an absolute constant C such 
that given p > 1, with probability 1, for all sufficiently large N , 



Wp{nN 



Y^mjv log(iV) 
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if ^ ^ P ^ '2' and 



ifp > 2. 

Proof. In Theorem [13] let = 5^^^^5M for p g [1, 2] and tj, = 5^^^^^^ for p > 2, 
and apply the Borel-Cantelli lemma. □ 



As noted earlier, by Rains's Proposition[3]above, in the case m = N, fi^^m is the empirical 
measure of i.i.d. points on By another result of Rains [15], the same is true when 
m > N. In particular, in all the above results the restriction m < N may be removed if m 
is simply replaced by min{m, N} in the conclusion. 



5. Other groups 

The approach taken above can be completed in essentially the same way for SO (A), 
§0~ (A) and Sp (2A^), so that all the results above hold in those cases as well, with only 
the precise values of constants changed. 

In [16], Rains proved that the eigenvalue distributions for these groups (or rather, compo- 
nents, in the case of SO~ (A)) can be decomposed similarly to the decomposition described 
in Proposition [3l although the decompositions are more complicated in those cases (mostly 
because of parity issues). The crucial fact, though, is that the decomposition is still in 
terms of independent copies of smaller-rank (orthogonal) groups and cosets. This allows for 
the representation of the eigenvalue counting function in all cases as a sum of independent 
Bernoulli random variables (allowing for the application of Bernstein's inequality) and as 
a sum of independent copies of eigenvalue counting functions for smaller-rank groups. In 
particular the analogue of Corollary [J] holds and it suffices to estimate the means and vari- 
ances in the case m = 1. The analogue of Proposition [8] for the other groups can be proved 
similarly using Proposition [5] and Lemma [6l 

With those tools and Proposition [7] on hand, the analogue of Theorem [11] can be proved 
in the same way, with a minor twist. One can bound as in the proof of Theorem [11] the 
distance between the empirical measure associated to the nontrivial eigenvalues and the 
uniform measure on the upper-half circle. Since the nontrivial eigenvalues occur in complex 
conjugate pairs and there are at most two trivial eigenvalues, one gets essentially the same 
bound for the distance between the empirical spectral measure and the uniform measure on 
the whole circle. 

Finally, logarithmic Sobolev inequalities — and hence concentration results analogous to 
Corollary [T7I — for the other groups are already known via the Bakry-Emery criterion, cf. 
[H Section 4.4], so that the analogue of Theorem [T3l follows as for the unitary group. 

For the special unitary group SU(A), all the results stated above hold exactly as stated 
for the full unitary group, cf. the proof of [14^ Lemma 2.5]. Analogous results for the full 
orthogonal group O (A) follow from the results for SO (A) and SO~ (A^) by conditioning 
on the determinant, cf. the proofs of Theorem 2.6 and Corollary 2.7 in [14j . 
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Appendix: the log-Sobolev constant of the unitary group 

In this section we prove a logarithmic Sobolev inequality for the unitary group with a 
constant of optimal order. As a consequence, we obtain a sharp concentration inequality, 
independent of for functions of k independent unitary random matrices. 

Recall the following general definitions for a metric space (X, d) equipped with a Borel 
probability measure /u. The entropy of a measurable function / : X — )■ [0, oo) with respect 
to ^ is 

Ent^(/) := j /log(/) d^^-(^j f df^^ / dfi^ . 

For a locally Lipschitz function / : X — t- M, 

\Vg\ (x) := hmsup . 

y^x d{y,x) 

We say that {X, d, //) satisfies a logarithmic Sobolev inequality (or log-Sobolev inequality 
for short) with constant C > if, for every locally Lipschitz / : X — )• M, 

(6) Ent^(/2) <2cj |V/|2 df,. 

Theorem 15. The unitary group U (A^), equipped with its uniform probability measure and 
the Hilbert-Schmidt metric, satisfies a logarithmic Sobolev inequality with constant 6/N . 

If the Riemannian structure on U (A) is given the normalization induced by the usual 
Hilbert-Schmidt inner product on matrix space, the the geodesic distance is bounded above 
by 7r/2 times the Hilbert-Schmidt distance on U ( A) (see e.g. [H Lemma 3.9.1]). Thus 
Theorem [15] implies that U (A) equipped with the geodesic distance also satisfies a log- 
Sobolev inequality, with constant 37r^/2A. 

It is already known that every compact Riemannian manifold, equipped with the normal- 
ized Riemannian volume measure and geodesic distance, satisfies a log-Sobolev inequality 
with some finite constant [T7]. For applications like those in this paper to a sequence of 
manifolds such as {U(A)}^^-|^, however, the order of the constant as A grows is crucial. 
The constant in Theorem [15] is best possible up to a constant factor. 

The key to the proof of Theorem [15] is the following representation of uniform measure 
on the unitary group. 

Lemma 16. Let 9 be uniformly distributed in [O, ^] and let V € SU {N) be uniformly 
distributed, with 9 and V independent. Then e^^V is uniformly distributed in ILJ(A). 

Proof. Let X be uniformly distributed in [0, 1), K uniformly distributed in {0, . . . , — 1}, 
and V uniformly distributed in SU ( A) with {X,K,V) independent. Consider 

U = ^'^■^iX / N ^2mK / N y 

On one hand, it is easy to see that {X + K) is uniformly distributed in [0, A^], so that 
^2-Ki(X+K)/N uniformly distributed on S-*^. Thus U = ojV for a; uniform in and inde- 
pendent of V . It is then straightforward to see that [/ € U (A) is uniformly distributed (cf. 
the proof of \\4\ Lemma 2.5]). 

On the other hand, if In is the iV x TV identity matrix, then c'^'^'^^^In G SU(iV). By 
the translation invariance of uniform measure on SU (A) this implies that e^'^*-^/^^ = V, 
and so e'^'^'^/^V = U. □ 
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Proof of TheorenillSi First, for the interval [0,27r] equipped with its standard metric and 
uniform measure, the optimal constant in ([6]) for functions / with /(O) = /(27r) is known 
to be 1, see e.g. [20]. This fact completes the proof — with a better constant than stated 
above — in the case = 1, since U(l) = S^; we assume from now on that N > 2. By 
reflection, the optimal constant for general locally Lipschitz functions on [0,7r] is also 1. It 



0, 



ttV2 



/N 



IS 



follows by a scaling argument that the optimal logarithmic Sobolev constant on 
2/N. 

By the Bakry-Emery Ricci curvature criterion [2], SU(A^) satisfies a log-Sobolev in- 
equality with constant 2/N when equipped with its geodesic distance, and hence also when 
equipped with the Hilbert-Schmidt metric (see Section 4.4 and Appendix F of [1]). By the 
tensorization property of log-Sobolev inequalities in Euclidean spaces (see [131 Corollary 

5.7]), the product space [o, ^) x§V{N), equipped with the L 2-sum metric, satisfies a 



log-Sobolev inequality with constant 2/N as well. 

Define the map F : \o, ^) x SU (iV) ^ U (iV) by F{t, V) = e^^^l^y . By Lemma 



with uniform 



[T6| the push-forward via F of the product of uniform measure on 
measure on SU(A^) is uniform measure on U(A^). Moreover, this map is -v/3-Lipschitz: 



HS 



< 



+ 

\Vi 



HS 



V2II F7<? + 



\HS 

^'^Whs + v^l^i 



v^iti/v^ T 



HS 



HS 



^2! 



V2\ 



HS 



+ \tl-t2\ 



Since the map F is -y/S-Lipschitz, its image U {N) with the (uniform) image measure 

^ □ 



satisfies a logarithmic Sobolev inequality with constant (\/3)^;^ 



Corollary 17. Given Ni,...,Nk G N, denote by M = U (A^i) x • • • U (Nk) equipped with 
the L2-sum of Hilbert-Schmidt metrics. Suppose that F : M ^ M is L-Lipschitz, and that 
{Uj G V (Nj) : I < j < k} are independent, uniform random unitary matrices. Then for 
each t > 0, 

F[F{Ui,...,Uk)>EF{Ui,...,Uk) + t] <e-^*'/i2^', 
where N = min{A^i, . . . , N/.}. 

Proof By Theorem [15] and the tensorization property of log-Sobolev inequalities \13\ Corol- 
lary 5.7], M satisfies a log-Sobolev inequality with constant The stated concentration 
inequality then follows from the Herbst argument (see, e.g., |13j . Theorem 5.3). □ 

The lack of dependence on /c is a crucial feature of the inequality in Corollary [T7] un- 
like logarithmic Sobolev inequalities, concentration inequalities themselves do not tensorize 
without introducing a dependence on k. 
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